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Abstract. The main theorem of this article is that every countable model of set theory 
(M, £ M ), including every well-founded model, is isomorphic to a submodel of its own con- 
structible universe (L M , G M ). In other words, there is an embedding j : M — > L M that is 
elementary for quantifier-free assertions. It follows from the proof that the countable models 
of set theory are linearly pre-ordered by embeddability: for any two countable models of 
set theory (M, G M ) and (A, 6^}, cither M is isomorphic to a submodel of A or conversely. 
Indeed, they are pre- well-ordered by embeddability in order-type exactly uj\ + 1. Specifically, 
the countable well-founded models are ordered under embeddability exactly in accordance 
with the heights of their ordinals; every shorter model embeds into every taller model; ev- 
ery model of set theory M is universal for all countable well-founded binary relations of 
rank at most Ord M ; and every ill-founded model of set theory is universal for all countable 
acyclic binary relations. Finally, strengthening a classical theorem of Ressayre, the same 
proof method shows that if M is any nonstandard model of PA, then every countable model 
of set theory — in particular, every model of ZFC plus large cardinals — is isomorphic to a 
submodel of the hereditarily finite sets (HF M , G M ) of M. Indeed, (HF M , E M ) is universal 
for all countable acyclic binary relations. 

1. Introduction 

In this article, I shall prove that every countable model of set theory (M, 6 ), including 
every well-founded model, is isomorphic to a submodel of its own constructible universe 
(L M , e M ). Another way to say this is that there is an embedding 

j : (M, G M > -> (L M , e M ) 

that is elementary for quantifier-free assertions in the language of set theory. 

Main Theorem 1. Every countable model of set theory (M, E M ) is isomorphic to a submodel 
of its own constructible universe (L M ,G M ). 

The proof uses universal digraph combinatorics, including an acyclic version of the count- 
able random digraph, which I call the countable random Q-graded digraph, and higher ana- 
logues arising as uncountable Fraisse limits, leading eventually to what I call the hypnagogic 
digraph, a set- homogeneous, class-universal, surreal-numbers-graded acyclic class digraph, 
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which is closely connected with the surreal numbers. The proof shows that (L M , G M ) con- 
tains a submodel that is a universal acyclic digraph of rank Ord M , and so in fact this model 
is universal for all countable acyclic binary relations of this rank. When M is ill-founded, 
this includes all acyclic binary relations. The method of proof also establishes the following, 
thereby answering a question posed by Ewan Delanoy [Delll]. 

Main Theorem 2. The countable models of set theory are linearly pre-ordered by embed- 
dability: for any two countable models of set theory (M, G M ) and (N, <E N ), either M is 
isomorphic to a submodel of N or conversely. Indeed, the countable models of set theory are 
pre- well- ordered by embeddability in order type exactly ou 1 + 1. 

The proof shows that the embeddability relation on the models of set theory conforms with 
their ordinal heights, in that any two models with the same ordinals are bi-embeddable; any 
shorter model embeds into any taller model; and the ill-founded models are all bi-embeddable 
and universal. 

The proof method arises most easily in finite set theory, showing that the nonstandard 
hereditarily finite sets HF M coded in any nonstandard model M of PA or even of IA 
are similarly universal for all acyclic binary relations. This strengthens a classical theorem 
of Ressayre, while simplifying the proof, replacing a partial saturation and resplendency 
argument with a soft appeal to graph universality. 

Main Theorem 3. If M is any nonstandard model of PA, then every countable model of set 
theory is isomorphic to a submodel of the hereditarily finite sets (HF M , G M ) of M. Indeed, 
(HF M , G M ) is universal for all countable acyclic binary relations. 

In particular, every countable model of ZFC and even of ZFC plus large cardinals arises 
as a submodel of (HF M , G M ). Thus, inside any nonstandard model of finite set theory, we 
may cast out some of the finite sets and thereby arrive at a copy of any desired model of 
infinite set theory, having infinite sets, uncountable sets or even large cardinals of whatever 
type we like. 

A structure M is universal for a class A of structures, if every structure in A is isomorphic 
to a substructure of M. In this article I use the term model of set theory to mean a first-order 
structure (M, G M ) satisfying at least the Kripke-Platek KP axioms of set theory, a very weak 
fragment of ZF. Although typically for the models of set theory I have in mind the models 
of ZFC or even ZFC plus large cardinals, it turns out that the results of this article go 
through for the much weaker theory KP, and indeed still weaker theories will suffice, but for 
definiteness I shall officially use KP, leaving the determination of how weak we may go for 
a later project. 

The language of set theory has only the set-membership relation G, and so a submodel of 
a model (M, G M ) of set theory is simply a subset N C M, where one restricts the relation 
to form (N,<E M \ N). An embedding of one model (M, G M ) into another (N, <E N ) is an 
isomorphism of M with a submodel of N, that is, a function j : M — >■ N for which x G M y 
if and only if j(x) (i N j(y). Since a model of set theory is a set with a binary relation, it is 
technically a certain special kind of directed graph, and many of the arguments of this article 
will proceed from this graph-theoretic perspective. From this perspective, a submodel of a 
model of set theory, viewed as a directed graph, is the same as an induced subgraph. 

The three main theorems reappear in this article as theorems 29, 33 and 9, respectively. 
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2. The countable random Q-graded digraph 

The main theorems will be proved by finding copies of certain universal digraphs among 
the submodels of the models of set theory under consideration. So let me begin by developing 
a little of this universal digraph theory. A digraph, or directed graph, is a structure (G, — 
where G is a set of vertices, or nodes, and — 1 is a binary relation on G. A digraph is acyclic 
if there is no finite directed path from a vertex to itself. That is, an acyclic digraph is one 
with no directed cycles. Note that the undirected version of an acyclic digraph may be far 
from a tree, since there can be undirected cycles. 

Let Q be the endless dense linear order of the rational numbers. A digraph G is Q-graded 
if every vertex a in G is assigned a rational value q a in such a way that q a < q\, whenever 
a — b. Such a graph must be acyclic, since the values increase along any directed path. 
More generally, for any linear order t we may consider the ^-graded directed graphs, and 
these also are acyclic. 

Lemma 4. Every countable acyclic digraph can be Q-graded by a suitable value assignment. 

Proof. Suppose that G is a countable acyclic digraph. Let < be the reachability relation on 
G, the transitive closure of the edge relation of G. Because G is acyclic, it follows that < is 
a partial order on the vertices. Since every partial order extends to a linear order, and every 
countable linear order embeds into Q, it follows that we may assign the vertices of G to 
rational numbers in such a way that a <\b implies q a < In particular, this is a Q-grading 
ofG. ' □ 

Although the value assignment arising in the proof of lemma 4 is one-to-one, in general 
a Q-graded digraph may have many nodes with the same value, and the universal graded 
digraphs appearing later in this article will have infinitely many nodes of each value. The 
grading of a digraph is in effect a laying-out of its vertices on levels, in such a way that every 
directed edge points from a lower-level node to a higher-level node. 

An alternative grading concept would regard a graded digraph (G,—^,<) as a digraph 
(G, — together with a linear pre-order < on the vertices whose corresponding strict order 
includes the edge relation, so that a — b implies a < b. This formalization founds the 
graded digraph concept in a finite first-order language, but in effect allows embeddings and 
isomorphisms to act also on the linear order, preserving merely the relative value rather 
than the exact value; but I shall not need this, and so in this article I have opted for the 
value-assignment concept of grading above and insist that graded digraph embeddings pre- 
serve these values. A structure A is homogeneous if every isomorphism of finitely generated 
substructures of A extends to an automorphism of A. 

Theorem 5. There is a countable homogeneous Q-graded digraph T, which is universal for 
all countable Q-graded digraphs. Furthermore, 

(1) There is a unique such digraph up to isomorphism, even when restricting universality 
to finite Q-graded digraphs. 

(2) Every countable acyclic digraph arises as an isomorphic copy of an induced subgraph 
ofT. 

(3) T admits a computable presentation. 



We call this T the countable random Q-graded digraph. 
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Proof. There are several independent constructions of this highly canonical object. Let me 
begin with the abstract realization of T as a Fraisse limit. Let Q be the collection of finite de- 
graded digraphs. Note that Q enjoys the hereditary property, that any induced subgraph of 
a graph in Q is in Q, and also the joint embedding property, asserting that any two members 
of Q embed into a third. Furthermore, Q has the amalgamation property, meaning that 
whenever A, B and C are in Q, where A embeds into B and also into C, then there is a D 
into which both B and C embed while agreeing on the image of A. These three properties 
are sufficient that Q has a Fraisse limit T, a countable, homogeneous, directed Q-graded 
graph, whose finite induced subgraphs are up to isomorphism precisely the graphs in Q (see 
[Hod93, thm 7.1.2]). The universal property of the Fraisse limit ensures that T is universal 
for all countable acyclic digraphs, because any such graph G is Q-graded by lemma 4 and 
since G is the direct limit of its finite induced subgraphs, each of which is in Q, it follows 
that one may amalgamate the limit construction of G as occurring inside T. 

Consider alternatively the following construction of T via forcing, which amounts essen- 
tially to the forcing construction of the Fraisse limit. Let P be the partial order of all finite 
Q-graded digraphs with nodes coming from a fixed countably infinite set, ordering the di- 
graphs of P by the induced subgraph relation. If A C B and C are any such finite Q-graded 
digraphs, with an embedding / : A — > C, then let Da,b,cj be the set of graphs H G P such 
that if C C H, then / extends to an embedding g : B — > H . This is a dense collection of con- 
ditions H, since for any Hq G P, if C C Hq then we can extend H to include a copy of B over 
f(A), and thus extend / to g : B — > H, as desired. Since there are only countably many such 
dense sets, we may build a chain of graphs H C Hi C • • • such that T = {J n H n meets every 
such dense set. It follows that T is weakly homogeneous, which means that whenever A C B 
and / : A — > F, then there is an extension to g : B — > F. This implies that T is universal for 
all countable Q-graded digraphs, since any such graph G is the union \J n G n of finite graphs, 
and weak homogeneity allows us to build up a chain of embeddings /„ : G n — > T, which 
realizes G as an induced subgraph of T. It also implies that T is fully homogeneous, by a 
similar back-and-forth argument that systematically extends a finite partial isomorphism to 
a full automorphism of T. 

Let me now describe a more concrete construction, which will realize T as a computable 
directed Q-graded graph. We build T as the union of a sequence of finite directed Q-graded 
graphs r„ G Q. Namely, let T be the empty graph, and at each stage n, we consider all 
possible ways to extend T n by the addition of one vertex with a value amongst the first 
n rational numbers, and let r„ + i realize all such possibilities. Since we can carry out this 
process by a computational procedure, the limit T = [j neuJ T n is a computable directed Q- 
graded graph. This graph T is universal for all countable directed cycle-free graphs G, since 
any such G can be Q-graded, and then we can build up the embedding of G into T in stages. 
Namely, we enumerate the vertices of G as v , Vi, and so on, and at stage n we have a copy of 
the first n vertices of G inside some r fcn . The next vertex v n has a certain value and exhibits 
a certain pattern of connectivity to the prior vertices of G. At a stage in the construction 
of T where the value q n of v n is allowed, we add a vertex exhibiting precisely that pattern 
of connectivity over the copy of the prior vertices that we have built in V. Thus, we may 
extend our embedding to include a copy of v n with the correct value. By continuing this 
process, we end with an embedding of G into T, as desired. 
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Finally, let me give a probabilistic account of T. Let T have infinitely many vertices of 
every rational value, and for each pair of nodes a and b with a having lower value than b, 
flip a coin to determine whether we place an edge between them. Thus, T is the random 
Q-graded directed graph. With probability one, every finite Q-graded directed graph arises 
as an induced subgraph, and furthermore, any particular finite subgraph A will find with 
probability one an extension inside T realizing any particular pattern of connectivity for a 
new vertex having a particular value (subject to the constraints on values). Thus, almost 
surely the resulting graph T will exhibit the finite-pattern property that allows us to extend 
any partial isomorphism via a back-and-forth argument to a full automorphism. □ 

The need to consider graded digraphs, rather than mere digraphs, arises from the fact 
that there simply are no nontrivial homogeneous acyclic digraphs: 

Observation 6. // an acyclic digraph G has a — b — c as an induced subgraph, then it is 
not homogeneous as a digraph (without any grading structure). 

Proof. Suppose G is an acyclic digraph with a — 1 b — 1 c as an induced subgraph, meaning 
here that a and c share no edge. It follows that the subgraph {a, c} is discrete and therefore 
has an automorphism swapping a and c. But this finite automorphism cannot extend to an 
automorphism it of the whole graph, since this would cause a directed cycle: a — ^ b c = 
7i(a) — 1 7r(6) 7r(c) = a, a contradiction. □ 

Essentially the same point is made by observing that the collection of acyclic digraphs does 
not enjoy the amalgamation property, since the discrete graph with two vertices embeds into 
a — b — c in two ways, but these cannot be amalgamated in any acyclic graph, since any 
amalgamation would give rise to a directed cycle. 

The purpose of using graded digraphs is to overcome the obstacle posed by observation 
6 and enable the theory of universality and homogeneity in the context of acyclic digraphs. 
The situation here is that the countable random Q-graded digraph is homogeneous as an 
Q-graded digraph, where the automorphisms respect the edge relation and the values of the 
Q-grading, but not as an unlabeled digraph, where the automorphism need only respect the 
edge relation. Note that for a — b c in the case of an Q-graded graph, the value of a must 
be strictly smaller than the value of c, which exactly prevents us in swapping a and c when 
we must preserve the values. Meanwhile, if one forgets both the values and the direction of 
edges, then in fact the undirected graph underlying the countable random Q-graded digraph 
is the same as the countable random graph. 

Observation 7. The graph obtained by ignoring the values and direction of the edges in the 
countable random Q-graded digraph is isomorphic to the countable random graph. 

Proof. It suffices to show that the graph underlying the countable random Q-graded digraph 
T exhibits the finite-pattern property characterizing the countable random graph, namely, 
that for any two disjoint sets A and C, there is a node connected to every edge in A and to no 
edge in C. Such a node exists in T by the universality and homogeneity of T: simply select 
a rational value not used by any node in A and observe that there is a larger graded graph 
realizing the desired pattern with respect to A and C and having that value; by universality 
there is a copy of this digraph inside T; by homogeneity, we may translate this copy to align 
with the given nodes of A and C. So the undirected ungraded digraph underlying T has 
the finite-pattern property, and consequently is isomorphic to the countable random graph, 
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since the countable random graph has this property and a simple back-and-forth argument 
shows that any two countable graphs with this property are isomorphic. □ 



3. A STRENGTHENING OF RESSAYRE'S THEOREM 

Every model of arithmetic comes along with a model of finite set theory via the Ackerman 
coding, obtained by defining the following relation on the natural numbers: 

n E m <^ the n th binary bit of m is 1. 

In the standard model N, this relation is well-founded and extensional, and it is an often- 
assigned elementary exercise to prove that the Mostowski collapse of E is precisely the 
hereditary finite sets. 

(N, E) (HF, e) 

The relation E can be defined inside any model of arithmetic M, leading to the corresponding 
model of finite set theory (HF M ,G M ), and conversely, the natural numbers of HF M are 
isomorphic to M again. In this way, the models of arithmetic are in a natural one-to-one 
correspondence with the models of finite set theory (properly understood), and the two 
theories are bi-interpretable. Specifically, [KW07] shows that the models of PA give rise 
to all the models of the finite set theory ZFCT 00 = ZFC — Inf + -Jnf + TC, where TC is 
the assertion that every set has a transitive closure, and conversely the natural numbers 
of any such model satisfies PA, in inverse fashion, making for a bi-interpretation of these 
two theories. The curious anomaly here is that while the axiom TC is provable and may 
be omitted if one replaces the usual foundation axiom in ZFC with the axiom scheme of e- 
induction, nevertheless results in [ESV11] show, quite interestingly, that TC is not provable 
if one uses only the usual foundation axiom. 

The theme of this article begins in earnest with the following remarkable theorem of 
Ressayre's. 

Theorem 8 (Ressayre [Res86]). If M is any nonstandard model of PA, with (HF M , E M ) the 
corresponding nonstandard hereditary finite sets of M, then for any consistent computably 
axiomatized theory T extending ZF in the language of set theory, there is a submodel N C 
(HF M , e M ) such that N |= T. 

In particular, we may find models of ZFC and even of ZFC + large cardinals as submodels 
of HF M , a structure whose theory is all about the finite and which thinks every object is finite. 
But nevertheless inside HF M there are submodels of the strongest infinitary theories we know! 
Incredible! How can this be? Ressayre's proof uses partial saturation and resplendency to 
prove that one can find the submodel of the desired theory T. 

Theorem 9 strengthens Ressayre's theorem, while simplifying the proof, by replacing the 
use of resplendency with a soft appeal to digraph universality. In particular, the role of 
the theory T is omitted: we need not assume that T is computable, and we don't just 
get one model of T, but rather all countable models of T, for the theorem shows that the 
nonstandard models of finite set theory are universal for all countable acyclic binary relations. 
In particular, every model of set theory arises as a submodel of (HF M , E M ). 
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Theorem 9 (Main theorem 3). If M is any nonstandard model of PA or indeed merely of 
I E\, then every countable model of set theory is isomorphic to a submodel of the hereditarily 
finite sets (HF M , G M ) of M. Indeed, (HF M , G M ) is universal for all countable acyclic binary 
relations. 

One may equivalently cast the theorem entirely in terms of finite set theory: every non- 
standard model (M, G M ) of finite set theory ZFC"°° is universal for all countable acyclic 
binary relations. In particular, every model of set theory, of ZFC or whatever theory, is 
isomorphic to a submodel of (M, G M ), even though this latter model believes that every set 
is finite. 

In order to prove theorem 9, we begin with a simple link between acyclic digraphs and 
sets with the set-membership relation. 

Lemma 10. Every finite acyclic digraph is isomorphic to a hereditarily finite set with the 
set-membership relation G. 

Proof. Suppose that G is a finite acyclic digraph with n vertices v±,. . . v n . We first extend G 
to a larger finite extensional digraph G + as follows. Let N be the transitive closure of the 
digraph 

0^1^ . n> 

which is the same as (n + 1, g), if one uses the von Neumann ordinals as natural numbers, 
and let G + be the graph containing the disjoint union of G and N, together with edges 
k — v k for k > 1. Note that (i) any two nodes of G have different predecessors in N; (ii) 
different nodes of N have different predecessors in N; and (iii) every node in N has as a 
predecessor, but no node in G has as a predecessor. Thus, no two nodes of G + have the 
same predecessors and so G + is extensional. Furthermore, since G and iV are both acyclic 
and we added only edges pointing from iV to G, it follows that G + is acyclic. In other 
words, the edge relation of G + is an extensional well-founded relation and therefore by the 
Mostowski collapse tt(v) = { n(w) \ w — 1 v } is isomorphic to a unique transitive set under 
the set membership relation G. 

w — 1 v if and only if n(w) G ir(v). 

This isomorphism associates each node in G + with the set of sets associated with its children. 
The isomorphism n carries the directed graph ((?,—*■) to the hereditarily finite set (A, g), 
where A = { ir(v) | v G G }, as desired. □ 

Note that this lemma is provable in finite set theory ZFCT 00 . It will be generalized by 
lemma 21, which handles the general case of well-founded digraphs. 

Proof of theorem 9. Suppose that M is a nonstandard model of PA, and that (HF M , G M ) 
is the corresponding nonstandard model of finite set theory ZFCT 00 built via the Ackerman 
coding in M. Consider the computable construction of the countable random Q-graded 
digraph T, as carried out inside M. Let A; be a nonstandard integer of M, and consider 
the k th stage of this construction Tff . Since the standard stages of construction will be the 
same inside M as externally, it follows that the standard part of is the actual countable 
random Q-graded digraph T itself. That is, T is an induced subgraph of Tff . Since M thinks 
Tfc is finite, we may apply lemma 10 inside M to find a set A G HF M such that M thinks 
(r fc , — = (A, G M ). Since T is an induced subgraph of Tff , we may restrict this isomorphism 
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(externally) to find (r, = (B, G M ) for some B C { a G HF | M |= a G A }. Thus, we 
have realized the countable random Q-graded digraph as isomorphic to a collection of sets 
in HF M . But since T is universal among all countable acyclic digraphs, it follows that every 
countable set with an acyclic binary relation is isomorphic to an induced subgraph of T, and 
hence to a subcollection of HF M under G M , as desired. 

Lastly, let me discuss the issue of weakening the theory PA to IAq, which allows induc- 
tion only for A assertions, and even down to IEi, which has induction only for bounded 
existential assertions. The reason is that every countable nonstandard model of IE\ has an 
initial segment that is a model of PA, by a result of Paris [Par84] , whose proof relies on the 
earlier corresponding result for IAq, proved by McAloon [McA82]. So if we begin with a 
model M \= IE\, therefore, we may simply cut down to the PA initial segment M \= PA, 
where we will find the universal structure by the argument of the previous paragraph. Thus, 
every model of an acyclic binary relation embeds into (HF M °, G M °), which is an initial seg- 
ment of (HF M , G M ), and so every model of an acyclic binary relation embeds into HF M , as 
desired. □ 

Corollary 11. The countable models of set theory, up to isomorphism, have a universal 
object under the submodel relation. Namely, if (M, G M ) is any u -nonstandard model of set 
theory, then every countable model of set theory is isomorphic to a submodel of (M, G M ) . 

Proof. Indeed, theorem 9 shows that every countable model (N, E) of any binary relation 
embeds already into the hereditary finite sets (HF M , G M ) of M. □ 

I will show later that in fact every countable nonstandard model of set theory is universal 
in this way, even if the ill-foundedness appears only higher than u>. 

4. ^-GRADED DIGRAPHS AND THE FINITE- PATTERN PROPERTY 

It will be convenient to consider grading notions other than Q, aiming eventually at the 
well-founded acylic digraphs, which are precisely the a-graded digraphs for some ordinal a, 
or in the case of proper classes, the Ord-graded digraphs, as well as the No-graded digraphs, 
using the surreal numbers No. To assist with this analysis, consider the general case of an 
arbitrary linear order £ and the ^-graded digraphs G, which have a value assignment v (->■ a v 
of the nodes v G G with elements a v G £ in such a way that v — w in G implies a v < a w 
in £. Any such graph is acyclic, since a directed cycle in G would give rise to a violation of 
asymmetry in £. 

Theorem 12. For any countable linear order £, there is a countable homogeneous £-graded 
digraph that is universal for all countable £-graded digraphs. This digraph is unique up to 
isomorphism (even when restricting universality to the finite £-graded digraphs), and has a 
computable presentation if £ is computable. 

Proof. The collection Q e of all finite ^-graded directed graphs has the hereditary property, 
the joint embedding property and the amalgamation property, and consequently admits 
a Fraisse limit, which is up to isomorphism the unique countable, homogeneous, ^-graded 
digraph of its age. This graph is universal for all countable ^-graded directed graphs by the 
same argument as in theorem 5. 

Alternatively, it may be easier simply to view £ as a suborder of the rational order Q, 
and then consider the restriction F \ £ oi the countable random Q-graded digraph T to the 

8 



vertices having a label in £. The subgraph r \ £ is ^-graded and inherits the homogeneity 
property from T, since any finite partial automorphism of T \ £ is also a finite partial 
automorphism of T, which therefore extends to an automorphism of T, which respects the 
labels and thus provides an automorphism of T \ £. The digraph T \ £ remains universal for 
^-graded directed graphs, because any such graph embeds into T in a way that uses only the 
labels in £ and hence embeds into T \ £. 

If £ is computable, then £ has a computable copy inside Q, and using the computable 
presentation of T from theorem 5, we obtain a computable presentation T \ £, as desired. □ 

Let us call this graph the countable random £-graded digraph. There is also a finite-pattern 
property for these graphs analogous to the w-categorical first-order characterization of the 
countable random graph, mentioned in observation 7. 

Definition 13. An ^-graded digraph Y satisfies the finite-pattern property, if for any disjoint 
finite sets of vertices A, B and C and any a G £, such that every vertex in A has value less 
than a and every vertex in B has value greater than a, then there is a vertex v in T with 
value exactly a such that a — v and v — 1 b for every a e A and b E B, but v has no edges 
with any vertex in C. 



a 




Figure 1 . The finite-pattern property 

Equivalently, an ^-graded digraph has the finite-pattern property just in case it is existen- 
tially closed in the class of all ^-graded digraphs. 

Theorem 14. If T is a countable £-graded digraph for a countable linear order £, then the 
following are equivalent: 

(1) T has the finite-pattern property. 

(2) T is isomorphic to the countable random £-graded digraph. 

Proof. It is clear that the countable random ^-graded digraph has the finite-pattern prop- 
erty, since each instance with particular A, B and C is clearly realizable in a finite ^-graded 
digraph, which must therefore embed into the countable random ^-graded digraph by uni- 
versality, thereby providing a realizing node v by homogeneity. 

Conversely, suppose that T is a countable ^-graded digraph with the finite-pattern property. 
It is clear by successive applications of this property that every ^-graded directed graph 
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occurs as an induced subgraph of T. Now, one simply applies a back-and-forth construction 
to realize that T is isomorphic to the countable random ^-graded digraph. At each stage, one 
applies the finite-pattern property either in T or in the countable random ^-graded digraph 
to extend the isomorphism one more step. □ 

Although the random digraphs that we have constructed are homogenous as Q-graded 
or as ^-graded digraphs, as in observation 6 they are definitely not homogeneous merely as 
digraphs, without the extra structure provided by the value assignment. Meanwhile, if one 
should forget the values and the directionality of the edges, then the argument of observation 
7 shows that the graph underlying the countable random ^-graded digraph, for any infinite 
linear order £, is isomorphic to the countable random graph. 

5. Uncountable homogeneous graphs and structures 

The main argument of subsequent sections will make use of uncountable homogeneous 
graded digraphs, and it will be convenient to have a general theory of how to construct 
such structures. So let us briefly review that theory here, beginning with the uncountable 
analogue of the Fraisse limit construction. 

Although the Fraisse limit construction is typically given for the countable case, where 
one produces a countable homogeneous structure as a limit of finitely generated structures, 
the idea generalizes naturally to uncountable structures, and I shall give a brief account here 
of how this can be done. 

For any first-order structure D, the age of D is the class of finitely generated structures 
that embed into D, that is, which are isomorphic to a substructure of D, and for any cardinal 
<5, the 5 -age is the class of <5-generated structures that embed into D. A structure D is 5- 
homogeneous if every isomorphism of two <5-generated substructures of D extends to an 
automorphism of D. 

Theorem 15. Suppose that K is a family of structures in a language C of size at most 5, 
an infinite cardinal for which 5 <s = 5, such that K is closed under isomorphism, contains 
at most 5 many isomorphism types, and every member of K is finitely generated. Suppose 
further that K exhibits the the following properties: 

HP (hereditary property) Every submodel of a model in K is in K . 

JEP (joint embedding property) If A, B E K there is C G K with A and B both embedding 
into C . 

AP (amalgamation property) For any A,B,C G K, if e : A — >■ B and f : A — > C are 
embeddings, then there is D G K with embeddings g : B —t D and h : C — >■ D such 
that ge = hf . 

Then there is a 5-homogeneous structure A of size 5 whose age is exactly K . This structure is 
unique up to isomorphism and universal among all structures of size 5 whose age is contained 
in K . 

Proof. See Hodges [Hod93, thm7.1.2] for an account of the countable case 5 = u. Assume 
the class K has HP, JEP and AP. Let K + be the class of «5-generated structures whose 
age is contained in K. Every element of K + is the direct limit of its finitely generated 
submodels, and these are all in K. Thus, K + is simply the closure of K under limits of 
directed systems of size less than 5. Clearly, K + retains the HP. In fact, K + also enjoys 
the JEP and the AP, which we prove jointly by induction on the number of generators for 
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the structure involved. For the JEP, consider two structures A,B G K + , where A is 7- 
generated and i? is ^-generated, where £ < 7. The case of finite 7 follows from the JEP 
of K. Otherwise, assume 7 is an infinite cardinal and express A as the union of a chain 
A = |Ja< 7 A a of <7-generated structures. By the AP induction hypothesis applied to A a 
and B, we may construct a chain Co C • • • C C a C • • • for a < 7, such that A Q and B 
embed into C a , and the embedding of A a into C a extends to the embedding of A a > into 
C a i for a < a'. The union C = [j a C a jointly embeds both A and B. A similar argument 
establishes the AP for K + . In addition to HP, JEP and AP, the class K + also enjoys the 
<5-limit property («5-LP), which asserts that if each A a G K + and 

AoC^C-.C^C... 

is a chain of length less than 5, then |J Q A a G 

Now, in a forcing-style presentation, let P be the structures in K + whose underlying set 
is contained in 8. (In the case that the language has 5 many constant symbols, then let us 
also fix a bijection 5 = 5x5 and insist furthermore that the /9-generated structures of P live 
in the part of 5 corresponding to x 5; this will ensure that there is sufficient extra room to 
expand the structure.) Every structure in K + is isomorphic to an element in P. We order P 
by the submodel relation, and consider it as a forcing notion, for which the larger models are 
stronger. Note that P is closed under the unions of chains of size less than 5. The fact that 
K + has the JEP means that for any A G K + the collection of conditions B G P such that A 
embeds into B is dense in P. The fact that K + has the AP means that for any A, B, C G P 
such that A C B and / : A — >■ C, the set of conditions DgP such that CCD4>/ extends 
to some g : B — >■ D, is dense below C in P. Since 5 <s = 5, there are only 5 many conditions 
in P and thus only 5 many dense sets of the type I have just mentioned in the previous 
sentences. Since P is also closed under the unions of chains of length less than 5, we may by 
build a length 5 chain of structures 

Do C • • • C D a C • • • 

from P in such a way so as to meet all these dense sets. It follows that the union D = \J a<s D a 
has size 5 and contains isomorphic copies of every A G K + . Further, the second family 
of dense sets exactly ensure that the structure D is 5-weakly homogeneous, meaning that 
whenever A C B and / : A — > D is an embedding, where A, B G K + , then / extends to 
some g : B — >■ D. It follows that D is universal for any structure B of size 5, whose age is 
contained in K. To see this, first express B = [J a<s B a as a union of structures B a G K + , and 
then build a coherent system of embeddings f a : B a — >■ D, applying the 5-weak homogeneity 
property of D to extend at successor stages. The result is an embedding of B into D, as 
desired. A similar back-and-forth argument shows that D is ^-homogeneous, and uniqueness 
follows from this. □ 

Note that the «5-limit property may be equivalently formulated as the assertion that the 
class is closed under direct limits of linear systems from K of length less than 5. 

A class structure is set-homogeneous if every isomorphism between two set substructures 
extends to a class automorphism of the whole structure. The global choice principle asserts 
that there is a proper class choice function selecting an element from every set. This is 
equivalent to the assertion that there is a proper class well-ordering of the universe in order 
type Ord. The axiom is most naturally stated in the second-order Godel-Bernays set theory, 
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which allows for classes that may not be definable, and so the global choice axiom does not 
necessarily imply V = HOD. 

Theorem 16. Assume the global choice principle. Suppose that K is a proper class of finitely 
generated models in language C, which may be a proper class, and that K is closed under 
isomorphic copies and exhibits the HP, JEP and AP. Then there is an set-homogeneous class 
model A, unique up to isomorphism, whose age is exactly K. Furthermore, every class model 
whose age is contained in K is embeddable into A. 

Proof. This is simply a proper class version of theorem 15. One sets up the class partial order 
P consisting of all set-generated structures, built on the ordinals, whose age is contained in 
K, and order them by induced submodel as before. The point is that this class forcing is set- 
closed, just as the forcing notion in theorem 15 was <5-closed, and so using the global choice 
principle, one may construct an Ord-length sequence of structures that meet all the dense 
classes corresponding to those in the proof of theorem 15. The result is a class structure 
A that is set-homogeneous and universal for all class structures whose age is contained in 
K. □ 



I would like now to construct canonical homogeneous ^-graded digraphs for an arbitrary 
linear order £, perhaps uncountable. Generalizing definition 13 to the uncountable, we say 
for a cardinal 8 that an ^-graded digraph G exhibits the <5-pattern property if for any disjoint 
sets A, B and C of nodes, with cardinality less than 5, and any a G £ for which every node 
in A has value less than a and every node in B has value greater than a, there is a node 
«6 A with value exactly a such that a — 1 v and v — 1 b for all a e A and b e B, while also 
v A. c for all c E C. 

As a first-order structure, we shall regard an ^-graded digraph as a structure (G, — U a ) ae e, 
where (G, — is the underlying digraph and U a is a unary relation holding of the vertices 
with value a. 

Theorem 17. For any linear order £ and any cardinal 5 > \£\ for which 5 <s = 5, there is a 
5-homogenous l-graded digraph of size 5, which is universal for l-graded digraphs of size at 
most 5. This graph is unique up to isomorphism and satisfies <S -pattern property. 

Proof. Let Q be the collection of all finite ^-graded digraphs. This collection has the hered- 
itary property, the joint embedding property and the amalgamation property. Thus, by 
theorem 15 it has a <5-Fraisse limit A, which is a 5-homogenous ^-graded digraph of size 5, 
universal for the ^-graded digraphs in Q, and unique up to isomorphism with these proper- 
ties. Further, it is universal among all ^-graded digraphs of size at most 5. From this and 
^-homogeneity, it follows that A has the <5-pattern property. □ 

Theorem 18. If the global axiom of choice holds, then there is a set-homogeneous Ord-graded 
class digraph, which is universal for all Ord-graded class digraphs. 

Proof. Appeal to theorem 16. □ 



The remarks after theorem 20 show that this digraph admits a canonical construction in 
the manner of the surreal numbers. 
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6. The surreal numbers and the hypnagogic digraph 

In theorem 18 we constructed a set-homogeneous Ord-graded class digraph that was uni- 
versal for all Ord-graded digraphs. This digraph arises as the class Frai'sse limit of all 
Ord-graded finite digraphs, and it is unique up to isomorphism. In this section, I would like 
to describe how this graph also arises by a canonical construction intimately connected with 
the construction of the surreal numbers and what I call the hypnagogic digraph. 

Let us first recall the standard construction of the class of surreal numbers, denoted No. 
Here, I am concerned with the surreal numbers only as a linear order; the ordered field 
structure and other structure on the surreals will not be relevant here. 1 To construct the 
surreal number line, one begins with nothing and then proceeds relentlessly, transfinitely, 
to fill all possible cuts in the order created so far. One defines the class No of all surreal 
numbers and their order in a simultaneous recursion, for which a surreal number is born at 
an ordinal stage if it is represented by a pair {A | B }, where A and B are sets of previously- 
born surreal numbers, where every number in A is smaller than every number in B. The 
idea is that { A \ B } will be a surreal number filling the cut between A and B, so that it is 
larger than every element of A and smaller than every element of B. The very first surreal 
number to be created is = { | }, meaning by this notation that we have the empty set on 
each side, and then immediately afterwards the cut above 1 = { | }, meaning the singleton 
set on the left and the empty set on the right, and similarly the cut below — 1 = { | 0}. 
Just as with the rational numbers, however, the left- set /right- set { A | B } representation of 
the surreal numbers will not be unique, for different pairs of sets can determine the same 
cut, and so we will quotient by an underlying equivalence relation. Specifically, for surreal 
numbers x = {X L \ X R } and y = {Y L \Y R }, we define x < y if and only if no obvious 
obstacle prevents it, namely, there is no 6 X L with y < x L and no y R G Y R with y R < x. 
Two numbers x and y are equivalent when x < y < x, and in this case we say that x and y 
are equal as surreal numbers. It is easy to prove that a surreal number { A | B } is equivalent 
to the first-born surreal number that is bigger than every element of A and smaller than 
every element of B. 

One sees immediately the similarity in the construction of the surreal numbers and the 
construction of the proper class Frai'sse limit of theorem 16. And indeed, by those methods 
one can see that the surreal numbers are simply the proper class Frai'sse limit of the class 
of all finite linear orders. They are set-homogeneous and universal for all proper class linear 
orders (provided global choice holds, but one can construct them and get universality for set 
linear orders just in ZFC). In this sense, the surreal number line is for proper class linear 
orders what the rational number line Q is for the countable linear orders. What I aim to do 
in this section is to continue this analogy to find the corresponding proper class analogue of 
the countable random Q-graded digraph. The answer is what I call the hypnagogic digraph, 
denoted Hg. To explain the terminology, the hypnagogic state is the dream-like, sometimes 
hallucinatory state at the boundary between wakefulness and sleep, and its usage here is 
meant to evoke both the universal property of the graph as well as its homogeneity: peering 



Considerably pre-dating the surreal numbers, the study of what we now call the saturated linear orders 
was undertaken first by Hausdorff [Hau02], who introduced the notation r\ for the countable dense linear 
order and rj a for the saturated dense linear order of size H a , when = H Q . In this notation, the surreal 

number line is simply goid- 
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into the hypnagogic digraph, one may see all the various dancing visions of any given graded 
digraph. 

Theorem 19. There is surreal-numbers- graded class digraph Hg 7 the hypnagogic digraph, 
such that: 

(1) Hg is universal for all graded digraphs using any linear order, as a suborder of No. 

(2) Hg is set-homogeneous, meaning that any isomorphism of two induced graded sub- 
graphs extends to an automorphism o/Hg. 

(3) If global choice holds, then Hg is universal for all proper class graded digraphs, using 
any class linear order as a suborder of No, and furthermore is uniquely determined 
up to isomorphism by (1) and (2). 

(4) Hg exhibits the set-pattern property. 

Proof. Just as for the countable random Q-graded digraph in theorem 5, there are numerous 
constructions of this highly canonical object. For example, the hypnagogic digraph is the 
Fraisse limit of the class of all finite No-graded digraphs, as described in theorem 16. It is 
simply the Ord-saturated No-graded class digraph. But meanwhile, there is also a highly 
canonical construction of the hypnagogic digraph, closely connected with the construction 
of the surreal numbers, which I prefer to take as the official definition of the class Hg. The 
idea is simply to use the left-set/right-set representation { A \ B } of the surreal numbers, 
but do not quotient by the equivalence relation! Rather, the hypnagogic digraph is the term 
algebra of the terms representing surreal numbers. That is, the nodes in Hg are all the 
various representations { A \ B } of surreal numbers, counted as different nodes even if they 
are equal as surreal numbers. For a given node v = { A \ B }, we place edges a — 1 v and v — 1 b 
for every a & A and b G B, and we define the value of v to be simply the surreal number that 
it represents. Thus, the meaning of { A | B } as a node in Hg is that it is pointed at from 
every a G A; it points at every b G B; and its value is { A \ B } as a surreal number. Taking 
a big-picture view of the construction, the hypnagogic digraph is the result of systematically 
adding nodes with a specified edge connectivity with the already constructed nodes. Under 
this description, it is clear that Hg satisfies the set-pattern property, which in turn implies the 
universality property and thus the set-homogeneity by a class-length back and forth property. 
(Note: one need not use the global choice principle for the set-homogeneity of Hg, since any 
automorphism of the nodes up to a given birthday induces a canonical automorphism of the 
cuts created at that birthday.) One gets universality for all class ^-graded digraphs, using any 
linear order £, simply because under the global choice principle every linear order embeds as 
a suborder of No, and then one can systematically build an embedding of the given ^-graded 
digraph by mapping to a node representing the desired value and connectivity. Uniqueness 
follows similarly by a class-length back-and-forth construction. □ 

One way to view the construction of the hypnagogic digraph is that when a surreal number 
v = { A | B } is born, the meaning of this term is that v is definitely larger than every 
element of A and smaller than every element of B. The hypnagogic digraph takes this 
meaning seriously by adding digraph edges exactly to record this information. In this way, 
the hypnagogic digraph seems fundamentally to underlie the surreal numbers, in a sense 
existing just prior to the surreal numbers. 

To further support the idea that the hypnagogic digraph Hg is the proper class analogue 
of the countable random Q-graded digraph, observe that the hypnagogic digraph Hg M as 
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computed inside a countable model M \= ZFC is externally isomorphic to the countable 
random Q-graded digraph. This is because the surreal numbers No M as computed in M are 
a countable dense linear order without endpoints, and thus isomorphic to the rational line Q, 
and Hg M is thus essentially a countable Q-graded digraph with the finite-pattern property, 
which is isomorphic to the countable random Q-graded digraph by theorem 14. 

The surreal number line No is universal for all class linear orders. For any such suborder 
£ C No, we may form the hypnagogic l-graded digraph Hg \ £, simply by restricting the 
hypnagogic digraph to the nodes with value in £. 

Theorem 20. For any class linear order £ C No, the hypnagogic l-graded digraph Hg \ £ 
is set-homogeneous and universal for all £- graded digraphs (including class digraphs if global 
choice holds). Furthermore, it is uniquely determined by these properties. 

Proof. Clearly Hg \ £ is an ^-graded digraph, and it inherits the set-homogeneity of Hg, 
since any isomorphism of induced subgraphs of Hg \ £ is also an isomorphism of induced 
subgraphs of Hg, which therefore extends to an automorphism of all of Hg, which provides an 
automorphism of Hg \ £ since these automorphisms respect the values of nodes. Furthermore, 
Hg \ £ is universal for all ^-graded digraphs, since Hg is, by embeddings that respect the 
values of nodes. Uniqueness follows (assuming the global choice principle) by the usual 
back- and- forth argument. □ 

In particular, since the ordinals are a suborder of the surreal numbers, Ord C No, we have 
the hypnagogic Ord-graded digraph Hg \ Ord, which by the uniqueness claim of theorem 20 
is the same Ord-graded digraph constructed in theorem 18. One can view the hypnagogic 
Ord-graded digraph as constructed in a grand recursion: at each stage, for any ordinal a up 
to that stage and any sets A and B of previously constructed nodes, with those in A having 
value less than a and those in B above, one forms a new node v with value a and creates 
edges a — 1 v and v — b for every a e A and b e B. 

Note that if M is a countable transitive model of set theory, with height A = Ord M , 
then the hypnagogic Ord-graded digraph (Hg \ Ord) M as constructed in M is isomorphic by 
theorem 14 to the countable random A-graded digraph. 

7. Realizing well-founded digraphs as sets 

The following basic lemma, a modification of the Mostowski collapse construction, shows 
how to realize well-founded acyclic digraphs as sets. Similar modified Mostowski collapse 
maps have arisen for diverse purposes, such as in the non-well-founded set theory of Aczel 
[Acz88] and in the finite set theory of Kirby [KirlO] and elsewhere. 

Lemma 21. Every well-founded digraph (G,—^) is isomorphic to a set (A,e) under the 
set-membership relation e. Furthermore, 

(1) If G is finite, then there is a hereditarily finite such set A. 

(2) IfG is countable and \-graded for some ordinal \, then A can be found with A C V^+a- 
In particular, if u 2 < \, then there is such A C V\. 

(3) IfG is \-graded and \G\ < \Vp\, then A can be found with A C Vp + 2+\. In particular, 
if 1 ^ A, then there is such A C V\. 

(4) If G is countable and \-graded, with A infinite, then A can be found with A C La+a- 
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Proof. Suppose that (G, — is a well-founded digraph. Define 

7i(x) = {7r(y)\y^x}U{{dS,x}}, 

and let A = { n(x) | rr G G }. This definition is well-defined by recursion on the well-founded 
relation — since the value of n(x) is determined by the values of 7r(y) for earlier y — ^ 
x. I claim that it is an isomorphism of (G, — with (A, g). It is essentially similar, of 
course, to the Mostowski collapse, but modified by the inclusion of the extra element { 0, x }, 
which is added in order to distinguish the nodes sufficiently (In the case that the original 
graph is extensional, then the nodes are already distinguished by their predecessors and this 
modification is unnecessary, for the Mostowski collapse is already an isomorphism.) It is 
clear from the definition that y — x implies n(y) G ir(x). For the converse direction, note 
first that no 7r(x) is empty and since {0,x} is nonempty, it follows also that ^ ir(y) for 
any y. In particular, n(y) ^ { 0,x} for any x and y. Thus, if n{y) G 7r(a;), it must be that 
7r(y) = tt(z) for some z — 1 x. Since { 0, y } G 7r(y) and { 0, z } G 7r(z), but these sets are not 
tc(u) for any u, it follows that { 0, y } = { 0, -2 } and consequently y = z and thus actually 
y — So I have established for all x,y £ G that 

y ^ x if and only if 7r(y) G 7r(x), 

and this is precisely what it means for 7r to be an isomorphism of (G, — with (A, G). This 
establishes the basic claim of the lemma. 

I now consider the further claims by analyzing the nature of A. If G is finite, then we 
may assume that the underlying nodes of G are natural numbers (or some other hereditary 
finite sets), and in particular, the sets { 0, x } arising in the definition of n are all hereditarily 
finite. It follows inductively that every ir(x) is hereditarily finite, and so A is a finite set 
of hereditarily finite sets and thus hereditarily finite itself, establishing (1). Similarly, for 
(2), if G is countable, then we may again assume that the nodes of G are natural numbers 
and consider the isomorphism it as defined above. By induction on values in the grading, it 
follows for any node x with value a that tt(x) G V w+a , since { 0,x } G V u and ir(x) consists 
otherwise of n(y), where y has some value (3 < a and hence by induction ir(y) G V u+ p. So 
A C V u+ \. If oo 2 < A, then u + A = A, and so in this case A C V\. 

A similar argument works in the case of (3). Assume that G is A-graded and the nodes of 
G come from Vp. In this case, {0,a;} G Vg+i at worst, and inductively tt(x) G Vp + 2+ a when 
a; has value a. And so A C Vp + 2+\- If /3 2 < A, then j3 + 2 + A = A, and so we have A C Va, 
as desired. 

Assertion (4) is subtle. As in theorem 20, let A = (Hg f A) La be the hypnagogic A- 
graded digraph as defined in L\. Since this graph is universal for all countable A-graded 
digraphs, the original digraph G embeds into A, and so it will suffice just to handle A. This 
is a definable class in L\. Although the edge relation — of A is definable in L\, we will 
not be able to carry out the modified Mostowski collapse of A inside L\, because A has 
A many nodes of each given value, and indeed, every node in A has A many predecessors 
there. Thus, the sets arising at every step of the modified Mostowski collapse will have 
size A, and consequently will not necessarily be elements of L\. Another way to say it is 
that the edge relation — ^ of A is not set-like in L\, and this is why if we want to use this 
graph directly, we have to build sets on top of L\, stretching up to L x+ \. Specifically, let 
7r : A — > A be the modified Mostowski collapse as defined above. I claim that A C L\ + \, by 
arguing that any node x G A of value a has ir{x) G L\ +1+a+1 . This is true if x has value 0, 
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since x G L\ and n(x) = {{0,^}} G L\ +2 at worst (although if A is a limit ordinal, one 
achieves L\ here). More generally, if the claim is true for all values below a and x has value 
a, then ir(x) = { n(y) \ y — 1 x} U { { 0, x } }, where for y — ^ rr we have n(y) G La+i+/j for 
some (3 < a, and so 7r(a;) is a subset of L A+1+a , which is definable from A,x G L A , and so 
7r(x) G Lx+i+a+i- Thus, altogether, A C L x+ \. So we have found a set A C such that 
(A, g) is the countable random A-graded digraph. It now follows by the universality property 
of this digraph identified in theorem 12 that any countable A-graded digraph is isomorphic 
to a subset of this particular A, establishing (4). □ 

It will follow as a consequence of the main theorem, theorem 29, that we may actually 
improve statements (2) and (4) of lemma 21 to the following assertion: 

(5) If G is countable and A-graded with A infinite, then {G, — = (A, g) for some A C L\. 

Indeed, this will be the key point of the proof of the main theorem. Meanwhile, let us record 
in the following observation the reason we are interested in graded digraphs, as far as models 
of set theory are concerned. 

Observation 22. Every model of set theory (M, G M ) is an Or d M -graded digraph, using von 
Neumann rank as values. 

Proof. Surely Ord M is a linear order, and y G x implies that y has lower rank than x. □ 

8. Warming up to the main theorems 

In this section I explain how lemma 21 can be used to prove several approximations to 
main theorem 1, which I give here as a warm-up, since the proof of main theorem 1 will 
introduce several complications. 

Proposition 23. If M is any countable transitive model of set theory and A = Ord M 7 then 
(M, G) is isomorphic to a submodel of {L\ + \, G). 

Proof. This proposition is a quick corollary to statement (4) of lemma 21, since if we view 
(M, G M ) as a A-graded digraph as in observation 22, then lemma 21 statement (4) says that 
it is isomorphic to a submodel of (L x +\, G), as desired. □ 

The need to go to L\ + \ in proposition 23 and in lemma 21 statement (4) is directly 
connected with the fact that the hypnagogic A-graded digraph A as computed in L\ is not 
set-like in L\, since every node has A many predecessors of each smaller value. Perhaps one 
might hope to overcome this difficulty by finding a A-graded digraph in L\ that was both 
set-like and had the finite-pattern property, for in this case, it would be universal for all 
A-graded binary relations and we would be able to perform the modified Mostowski collapse 
of lemma 21 inside L\ itself. This would prove that M is isomorphic to a submodel of 
L\ = L M , which is the main goal here. Unfortunately, however, the following result shows 
that there is no such digraph in any model of ZF. 

Observation 24. No model of ZF has an Ord-graded digraph class T with the finite-pattern 
property, such that T is set-like. 

Proof. Suppose that T is a set-like Ord-graded digraph class with the finite-pattern property. 
Consider any fixed node p with nonzero value, and let A p be the set of predecessors of p in 
T, which is a set precisely because we assumed that T is set-like. Since A p has only a set 
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number of subsets, but there are a proper class of higher levels, it must be that there are 
two distinct nodes q, r with value above p, but with the same pattern of predecessors on A p . 
That is, for v G A p , we have v — q if and only if v — r. This violates the finite-pattern 
property, which would require the existence of a node v with v — ^ p and v ^ q but »lr. 
Such a vertex t> would reveal that g and r have different predecessors below p, contrary to 
assumption. □ 

The complications introduced into the proof of the main theorem, using the surrogate 
parent concept, are aimed specifically at overcoming this difficulty Meanwhile, we can also 
overcome the difficulty by relaxing to a weaker but still commonly considered theory. Let 
"V = H K +" be the theory asserting that re is the largest cardinal and that every set has 
hereditary size at most re. For any cardinal re, the collection H K + is a model of ZFC~ + V = 
H K + , and these structures and their elementary substructures are very commonly considered 
in set theory. 

Theorem 25. Suppose M is a countable transitive model of ZFC~ + V = H K +, where re <K = re 
in M and A = Ord M . Then there is a \-graded digraph A C M, which is set-like in M and 
which obeys the <K-pattern property in M . 

Proof. We modify the construction of the hypnagogic A-graded digraph so that it will become 
set-like, by restricting it to have re many nodes of any given value. Specifically, for each (3 < A 
of size re, consider the construction in M of the /3-graded hypnagogic digraph as the Fraisse 
limit of the finite /3-graded digraphs. This graph is an element of M and has the <re-pattern 
property in M. Furthermore, for any j3 < 7 < A and any copy of the /3-graded hypnagogic 
digraph Ap, it may be extended to the hypnagogic 7-graded digraph A 7 as constructed in 
M, and furthermore extended in such a way that is precisely A 7 \ j3. This is simply 
because A 7 \ (3 is re- homogeneous and has the < re-pattern property, and hence is isomorphic 
to A^. Continuing the main argument, now, I select an increasing sequence (A n | n < u) 
cofinal in A and form the limit graph A = [J n A\ n , where the A„ form a coherent sequence 
of induced subgraphs, with A An = A \ X n . The graph A is set-like in M, because every node 
lives in some A n , which is an element of M, and all its children are also in A n . Furthermore, 
A is A-graded and has the finite-pattern property. Moreover, each A n and consequently also 
A has the <re-pattern property with respect to M, since every subset of A that is in M is 
contained in some A n . □ 

Note that the final construction of A given in theorem 25 was external to M, using the 
sequence (A n | n < ou), although the graph was built as the union of initial segments that 
were elements of M and consequently A is at least amenable to M. But there seems little 
reason to expect for this construction that (M, G,A) satisfies ZFC~(A). Notice also that 
since A is countable, the digraph A of theorem 25 is exactly the countable random A-graded 
digraph. The important part of the theorem is that this digraph is set-like in M, which 
allows us to perform the modified Mostowski collapse of lemma 21. 

Corollary 26. If M is a countable transitive model of the theory ZFC~ + V = H K + and re <K = 
re in M and A = Ord M ; then there is a submodel ACM such that (A, e) is the countable 
random \-graded digraph. Consequently, every countable \-graded digraph is isomorphic to 
a submodel of (A, e). In particular, every countable transitive model of set theory of height 
at most A is isomorphic to a submodel of (M, G M ) . 
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Proof. By theorem 25, there is a set-like A-graded digraph A with the finite-pattern property, 
whose initial segments are sets in M. From this it follows that the modified Mostowski 
collapse of A as defined in lemma 21 is an isomorphism of (A, — = (A, e) for some A C M. 
Since A has the finite-pattern property, it and hence also {A, e) are the countable random A- 
graded digraph, which is universal for all countable A-graded digraphs. Thus, every countable 
A-graded digraph is isomorphic to a submodel of (A, g) and hence of (M, e). By observation 
22, this includes all countable models of set theory of well-founded height at most A. □ 

The key obstacle identified in the proof of observation 24 is the existence of more ordinals 
than the size of any given power set. The following theorem shows that this obstacle is the 
only obstacle, for if we are willing to give up the power set by adding many Cohen reals, we 
can find the desired universal digraph. 

Theorem 27. Assume V \= ZFC. If G C Add(w, Ord) is V- generic for the forcing to add 
Ord many Cohen reals, then V[G] \= ZFC~ has a submodel A C V[G] such that (A, e) is an 
Ord- graded digraph with the finite-pattern property. 

Proof. The idea is to build a set-like Ord-graded digraph A inside V[G\. The nodes of A will 
be precisely the elements of Ord xw, and (a, n) will have value a. What remains is to specify 
the edges, which will be done generically. Let P be the class partial order consisting of a 
finite digraph G, whose vertices are contained in Ord xu, ordered by the (reverse) induced 
subgraph relation G < H -h- H C G, so that as usual, stronger conditions are lower in the 
partial order. 

I claim that this partial order is isomorphic to the forcing Add(w, Ord) to add Ord many 
Cohen reals, which is the same as the forcing to add a generic class function g : Ord — > 2 
by finite conditions. One can see this simply by observing that the forcing P is deciding 
generically and independently for each pair of nodes whether to place an edge (a, n) — 1 (j3, k), 
where a < (3, or not. Thus, P is adding, by finite support, a generic function that decides of 
each possible edge, whether it is there or not. Thus, this forcing resonates with the random 
characterization of the countable random digraphs we saw earlier, but using genericity in 
place of randomness. 

Suppose that A is the resulting V-generic Ord-grade digraph. It is clear that we will 
achieve the finite-pattern property, since if A, B and C are finite subsets of the nodes and 
a is an ordinal such that every node in A has value less than a and every node in B has 
value larger than a, then it is a dense requirement that there is a node with value a that is 
above all nodes in A, below all nodes in B and with no edges in C. The reason is that any 
condition mentions only finitely many additional nodes, and there will be an unmentioned 
node on level a that we may simply add in so as to satisfy the desired pattern. 

Finally, the graph A is set-like in V[G], because the predecessors of (a, n) in A are amongst 
axw, which is a set. It follows that we may perform the modified Mostowski collapse of A 
as in lemma 21, and find A C L[G] such that (A, -^) = (A, e). □ 

Corollary 28. If M is a countable transitive model of set theory of height X, then M is 
isomorphic to a submodel of (L\[G],E), a forcing extension of L\ obtained by adding A 
many Cohen reals. 

Proof. By theorem 27, there is A C L[G] such that (A, e) is A-graded and has the finite- 
pattern property. By theorem 14, it follows that (A, e) is the countable random A-graded 
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digraph, which is universal for all countable A-graded digraphs. Since (M, g) is a A-graded 
digraph by observation 22, it follows that (M, g) is isomorphic to a submodel of (A, g) and 
hence to a submodel of g). □ 

This corollary will be improved by the main theorem, but it is already surprising, because 
the model M may have many large cardinals or other complex objects such as 0" that are 
fundamentally incompatible with V — L or V — L[G\. Nevertheless, the main theorem will 
omit the need for G and obtain the even more surprising result that M actually embeds as 
a submodel of L\ itself. 

9. Proving the main theorems 

Finally, I am ready to prove the main theorems 1 and 2. 

Theorem 29 (Main Theorem 1). Every countable model of set theory (M, G M ) is isomor- 
phic to a submodel of its own constructive universe (L M ,G M ). In other words, there is a 
quantifier- free- elementary embedding 

j:(M,e M )^(L M ,e M >. 

Proof. Suppose that M is a countable model of ZFC, not necessarily transitive or well- 
founded. Let A n G Ord M be an increasing cofinal sequence of limit ordinals from M. By the 
reflection theorem, we may assume without loss of generality that V^ M -<s n M, although the 
argument will not really use much of this. Let T n be the hypnagogic (A n + l)-graded digraph 
as constructed in , running the construction for A n+ i many birthdays. Thus, each value 
level of T n has A ra+ i many elements, and T n has a top layer consisting of the nodes with value 
A n . Furthermore, each T n has the finite-pattern property, each T n is an induced subgraph of 
r n+ i and the union T = \J n T n of the chain is exactly the hypnagogic Ord-graded digraph of 
L M . Although each r n is a set in L M , the union digraph T is not set-like in L M , since every 
node in r n gains A„ + 2 many new children in r n+1 . 

I shall presently use the graphs T n in order to construct a somewhat more elaborate graph, 
which I call the surrogate digraph, which will be set-like and which will exhibit a kind of 
finite-pattern property that will be sufficient for universality. Define that (t> , • • • ,v n ) is a 
surrogate sequence of its final node v n , if v n G r n , the value of v n is at least sup fe<n A^ and 
each Vk for k < n is a node in Tk of value A&. For surrogate sequences v = (vq, . . . , v n ) and 
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Figure 2. Surrogate sequences in the surrogate digraph 

w = (wo, . . . ,w m ), define the surrogate edge relation w v to hold if and only if m < n 
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and w m — 1 v m . The idea here is that we split the parent and child roles in T, so that only 
the final node w m of a surrogate sequence (wq, . . . , w m ) acts as a child, while the prior nodes 
w k act as surrogate parents, as far as gaining new children below value \ k is concerned. It 
may be helpful to think of the relation that a set x has with its projections x D V a for a of 
much smaller von Neumann rank than x, for we can tell if y G x or not, when y has rank less 
than a, merely by knowing whether or not y G x n V Q . In our final analysis, the surrogates 
v k will be something like (but not exactly like) these projections, to capture the elements of 
rank less than \ k , and to do so without leaving T k . This surrogate maneuver will allow us 
to surmount the obstacle of observation 24. 

Let n be the (A n + l)-graded digraph consisting of all surrogate sequences of length at 
most n, using -» as the edge relation, and giving the surrogate sequence the value of its final 
node in T. We will refer to the union digraph = |J n O n as the Ord M -graded surrogate 
digraph arising from (A n | n < u>). Because we can define the sequence (A n | n < u>) from 
O, we cannot expect that it is a class in M. Nevertheless, each n is a set in M and an 
induced subgraph of 0, the restriction of to surrogate sequences having value at most A n . 
It follows that the full surrogate digraph is an amenable class over M and indeed over L M . 
In particular, is set-like in L M , since any particular surrogate sequence w lies in some n , 
which is a set in L M and is the (A„ + l)-initial segment of 0. 

Let's define that two subsets of are completely disjoint if they are disjoint and further- 
more any two surrogate sequences from either of them have no nodes in common. That is, 
not only are they disjoint as sets of sequences, but the individual nodes on those sequences 
do not recur. 

Lemma 29.1. The surrogate digraph enjoys the surrogate finite-pattern property: if A, B 
and C are completely disjoint finite subsets of and a is an ordinal of M , such that every 
element of A has value less than a and every element of B has value greater than a, then 
there is v G with value a such that a -» v and v -» b for all a £ A and b G B, and there 
are no -» relations between v and elements of C , and furthermore, such that no nodes of v 
arise on any sequence from A, B or C . 

Proof. Let n be least such that a < A n . We shall first choose the terminal node v n of 
the desired surrogate sequence v = (vq, . . . , v n ). Let B n consist of the nodes arising in the 
sequences of B with value in the interval (a,X n ]. These nodes arise either as the terminal 
nodes of sequences in B that terminate with a node of value at most A„, or else they arise 
as the surrogate nodes at level A n of a longer sequence. But in any case, B n C T n , and 
furthermore, for v to exhibit the correct -» relation to the elements of B, it will suffice that 
v n — b for each b G B n ; and the main point of surrogates is that we shall be able to find 
such a v n inside r ra , without care for the much larger value nodes that may appear later on 
in the sequences of B. But we must also ensure the correct -» relation to A and C, so let 
A n be the terminal nodes of any sequence in A having value at least sup k<n \ k , and let C n 
be the nodes occuring on a sequence in C that are in r n . Since A, B and C are completely 
disjoint, it follows that A n , B n and C n are disjoint subsets of T n , and every node in A n has 
value below a and every node in B n has value above a. Thus, by the finite-pattern property 
of T n , there is a new node v n G r„ such that a — v n and v n — 1 b for every a G A n and b G B n 
and such that v n has no — ^ relation with any node of C n . Next, to define v k for k < n, let 
A k be the nodes arising from the sequences in A in the A; th layer, that is, with value in the 
interval [sup i<A . Aj, \ k ), and similarly let C k be any node on any sequence in C in T k . (We 
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need not consider B for defining the surrogates Vk for k < n, since these surrogates act only 
as parents and not as children with respect to -».) By the finite-pattern property of F^, 
there is a new node Vk such that a — Vk for each a <E Ak and Vk has no — 1 relation with any 
node in C^. This defines v = (v , . . . ,v n ), which has value a and which consists of entirely 
new nodes not arising in A, B and C . Furthermore, by design, a -» v for each a G A and 
v -» b for each b £ B, whilst v has no -» relation with any sequence in C, and so we have 
fulfilled this instance of the surrogate finite-pattern property. □ 

So what we have is a set-like Ord M -graded digraph C L M , which does not have the 
finite-pattern property (and cannot by observation 6), but which has the surrogate finite- 
pattern property, and this is sufficient to carry out the universality construction. Specifically, 
we assign to each x G M a surrogate sequence v x , with the same value as rank(x), ensuring 
that x G y v x -» v y . This can be achieved by enumerating M = {x n \ n < u}, and 
choosing each v Xn so as to relate via -» to the previous v Xk for k < n in exactly the same way 
that x n relates to x k via G, while also having the right value and inductively maintaining 
that the sequences v Xn have no individual nodes in common. The surrogate finite-pattern 
property of the lemma exactly ensures that this recursive construction may proceed, and 
thus we build an embedding of (M, G M ) into an induced subgraph of (O, -»). 

Finally, I argue that (6, ^) = (A, G M ) for some A C L M , using the fact that -» is 
set-like in L M . Specifically, since each O n is a set in L M , where -» is well-founded, we 
may inside L M carry out the modified Mostowski collapse of lemma 21, defining it(v) = 
{ tt(w) I w -» v} U {{0,-w}} to find (0 n , -») = (A n , G M ) G L M . Furthermore, since each 
O n is a -» initial segment of 0, it follows that these various isomorphisms cohere, and so 
(0, -») ^ (A, G M >, where A = [j n A n C L M . 

Combining the two previous paragraphs, we observe that M is isomorphic to an induced 
subgraph of 0, which is isomorphic to a submodel of L M , and so M is isomorphic to a 
submodel of L M , as desired. For the final claim of the theorem, note that an isomorphism 
of M to a submodel of L M is precisely the same as a quantifier-free-elementary embedding 
j : M —> L M . ' ' □ 

Corollary 30. Every countable model (M, G M ) of set theory is universal for all countable 
Ord M -graded binary relations. 

Proof. This is what the proof of theorem 29 establishes. The surrogate finite-pattern property 
of ensures that it is universal for all countable Ord M -graded binary relations. □ 

Corollary 31. A countable model (M, G M ) of set theory embeds into another model (N, E N ) 
of set theory if and only if the ordinals of M map order-preservingly into the ordinals of N. 

Proof. We needn't assume iV is countable here, since we may simply pass to a countable 
submodel. For the forward implication, if j : M — > N is an G-embedding, then f(a) = 
rank(j(o;)) is an order-preserving map from Ord M to Ord^. Conversely, if the ordinals of 
M map into the ordinals of N, then we may externally view (M, G M ) as an Ord^-graded 
digraph, and the previous corollary shows that (N, <E N ) is universal for such relations. □ 

Corollary 32. Every ill-founded countable model of set theory (M, G M ) is universal for all 
countable acyclic binary relations. 

Proof. If M is not an w-model, then already the hereditary finite sets HF M of M are universal 
by theorem 9. If M has a standard u>, but the ordinals are ill-founded, then by results in 
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[Fri73], it follows that OrcT = A • (1 + Q) for some admissible ordinal A. In particular, Ord 
contains a countable dense order, and so by lemma 4 every acyclic binary relation can be 
Ord M -graded. Thus, (M, G M ) is universal for all such relations. □ 

Theorem 33 (Main Theorem 2). The countable models of set theory are linearly pre-ordered 
by embeddability: for any two countable models of set theory (M, G M ) and (N, <E N ) , either M 
is isomorphic to a submodel of N or conversely. Indeed, the countable models of set theory 
are pre-w ell- ordered by embeddability in order type exactly Ui + 1. 

Proof. Corollary 32 shows that the models having ill-founded ordinals are all universal, so 
that every countable model of set theory embeds into any of them. In particular, they are 
all bi-embeddable with each other. What remains are the well-founded models, which by 
corollary 31 are ordered the same as their ordinals. In particular, any two well-founded 
models with the same ordinals are bi-embeddable, and any shorter model is embeddable into 
any taller model. 

Thus, the bi-embeddability classes of the models of set theory are in one-to-one corre- 
spondence with the collection of ordinals A such that L\ is a model of set theory, plus one 
more bi-embeddability class at the top for the ill-founded models. Since there are u± many 
countable admissible ordinals, it follows that the order-type of the bi-embeddability classes 
of the countable models of set theory (understood as models of KP) is precisely ui + 1. □ 

If one wants to consider only the countable models of ZFC, or only the models of ZFC 
plus large cardinals, then the argument shows that they are pre-well-ordered in order type 
T) + l, where rj is the number of countable ordinals A for which there is a model of the theory 
having height A. If there is an inaccessible cardinal, for example, then it follows that rj = io\ 
for ZFC models and the countable models of ZFC are pre-well-ordered by embeddability in 
order type iO\ + 1. 

Finally, let me observe that one definitely doesn't have linearity for the embeddability 
relation in the context of uncountable models. 

Observation 34. // ZFC is consistent, then there are two models M, N \= ZFC, such that 
neither is isomorphic to a submodel of the other. 

Proof. If ZFC is consistent, then on the one hand, there is an uncountable model M \= ZFC 
with cofinality ui and with every cut of cofinality u>. On the other hand, one may also build 
an wi-like model N |= ZFC, so N is uncountable, but every initial segment is countable. It 
follows that N has cofinality U\. 

Note that M cannot embed into N, since there are elements of M with uncountably many 
predecessors, but every element of N has only countably many predecessors. And N cannot 
embed into M, since the supremum of the image would be a cut of uncountable cofinality. □ 

10. Questions 

Although the main theorem shows that every countable model of set theory embeds into 
its own constructible universe 

J : M -> L M , 

this embedding j is constructed completely externally to M and there is little reason to 
expect that j could be a class in M or otherwise amenable to M. To what extent can we 
prove or refute the possibility that j is a class in Ml This amounts to considering the matter 
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internally as a question about V. Surely it would seem strange to have a class embedding 
j : V — > L when V ^ L, even if it is elementary only for quantifier-free assertions, since such 
an embedding is totally unlike the sorts of embeddings that one usually encounters in set 
theory. Nevertheless, I am at a loss to refute the hypothesis, and the possibility that there 
might be such an embedding is intriguing, if not tantalizing, for one imagines all kinds of 
constructions that pull structure from L back into V. 

Question 35. Can there be an embedding j : V — >■ L when V ^ L? 

By embedding, I mean an isomorphism from (V, g) to its range in (L, G), which is the same 
as a quantifier-free-elementary map j : V — > L, a map for which x G y j(x) G j(y)- (Note 
that if V — L, then we have some easily defined non-identity embeddings j : L — >■ L arising 
from lemma 21, such as j(x) — { j(y) \ y G x } U { { 0, x } }, which has j(x) ^ x for every x.) 
Question 35 is most naturally formalized in Godel-Bernays set theory, asking whether there 
can be a GB-class j forming such an embedding. If one wants j : V — > L to be a definable 
class, then this of course implies V = HOD, since the definable L-order can be pulled back 
to V, via x < y j(s) <l j(y)- More generally, if j is merely a class in Godel-Bernays 
set theory, then the existence of an embedding j : V — > L implies global choice, since from 
the class j we can pull back the L-order. For these reasons, we cannot expect every model 
of ZFC or of GB to have such embeddings. Can they be added generically? Do they have 
some large cardinal strength? Are they outright refutable? 

It they are not outright refutable, then it would seem natural that these questions might 
involve large cardinals; perhaps 0" is relevant. But I am unsure which way the answers will 
go. The existence of large cardinals provides extra strength, but may at the same time make 
it harder to have the embedding, since it pushes V further away from L. For example, it is 
conceivable that the existence of 0" will enable one to construct the embedding, using the 
Silver indiscernibles to find a universal submodel of L; but it is also conceivable that the 
non-existence of 0", because of covering and the corresponding essential closeness of V to L, 
may make it easier for such a j to exist. Or perhaps it is simply refutable in any case. The 
first-order analogue of the question is: 

Question 36. Does every set A admit an embedding j : (A, G) — > (L, G) ? If not, which sets 
do admit such embeddings? 

The main theorem shows that every countable set A embeds into L. What about uncount- 
able sets? Let us make the question extremely concrete: 

Question 37. Does (V w+1 , G) embed into (L, G) ? How about (P{u), G) or (HC, G) ? 

It is also natural to inquire about the nature of j : M — > L M even when it is not a class 
in M. For example, can one find such an embedding for which j(a) is an ordinal whenever 
a is an ordinal? The embedding arising in the proof of theorem 29 definitely does not have 
this feature, since every set in M is mapped to a node in O, which is mapped ultimately to 
ir(v) = { ir(w) | w -» v } U {{0, v}}, and this latter set is never an ordinal. 

Question 38. Does every countable model (M, G M ) of set theory admit an embedding j : 
M — > L M that takes ordinals to ordinals? 

Probably one can arrange this simply by being a bit more careful with the modified 
Mostowski procedure in lemma 21, as it is applied to O in the proof of the main theorem, 
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theorem 29. And if this is correct, then numerous further questions immediately come to 
mind, concerning the extent to which we ensure more attractive features for the embeddings 
j that arise in the main theorems. This will be particularly interesting in the case of well- 
founded models, as well as in the case of j : V — > L, as in question 35, if that should be 
possible. 

Question 39. Can there be a nontrivial embedding j : V — > L that takes ordinals to ordinals? 

Finally, we inquire about the extent to which the main theorems of this article can be 
extended from the countable models of set theory to the Wi-like models: 

Question 40. Does every u\-like model of set theory (M, G M ) admit an embedding j : M — > 
L M into its own constructible universe? Are the u\-like models of set theory linearly pre- 
ordered by embeddability? 
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